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Abstract 

Priebe et al. (2001) introduced the class cover catch digraphs and computed the distribution of the 
domination number of such digraphs for one dimensional data. In higher dimensions these calculations 
are extremely difficult due to the geometry of the proximity regions; and only upper-bounds are available. 
In this article, we introduce a new type of data-random proximity map and the associated (di)graph in 
R''. We find the asymptotic distribution of the domination number and use it for testing spatial point 
patterns of segregation and association. 
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1 Introduction 



In a digraph D = (V,^) with vertex set V and arc (directed edge) set A, a vertex v dominates itself and 
all vertices of the form {u : vu G A}. A dominating set, Sd, for the digraph D is a subset of V such that 
each vertex w e V is dominated by a vertex in So- A minimum dominating set, S'^, is a dominating set 
of minimum cardinality; and the domination number, "f{D), is defined as "f{D) := {S'^l, where | • | is the 
cardinality functional ( [West, 2001| ). If a minimum dominating set is of size one, we call it a dominating 
point. 

Let {fl,A4) be a measurable space and consider a function N : Q x 2^ ^ 2^, where 2^ represents the 
power set of fl. Then given y C fl, the proximity map Ny{-) = N{-,y) : — > 2^^ associates with each point 
a; G a proximity region Ny(x) C ^l. The region Ny(x) depends on the distance between x and y. For 
B cn, the Ti-region, ri(-) = ri{-,Ny) : fl —> 2^^ associates the region Ti{B) := {z £ fl : B C Ny{z)} with 
each set B Cil. For x e 17, we denote Fidx}) as ri(a;). 

If Xn ~ {Xi,X2,- ■ ■ ,Xn} is a set of il-valued random variables, then the Ny{Xi) (and Ti{Xi)), i = 
1, • • • ,n are random sets. If the Xi are independent and identically distributed, then so are the random 
sets Ny{Xi) (and ri{Xi)). Furthermore, ri{X„) is a random set. Notice that ri(A'„) = r\^^iTi{Xj), since 
X e Ti{Xn) iff Xn C Ny{x) iff Xj € Ny{x) for all j = l,...,n iff .t € Ti{Xj) for aU 7 = iff 
x&n^^,T,{Xj). 

Consider the data-random proximity catch digraph D with vertex set V = and arc set A defined by 
{Xi,Xj) E A <;=^ Xj E Ny{Xi). The random digraph D depends on the (joint) distribution of the Xi and 
on the map Ny (see Priebe et al. (2001) and Priebe et al. (2003)). The adjective proximity — for the catch 
digraph D and for the map Ny — comes from thinking of the region Ny{x) as representing those points in 
n "close" to X (see, e.g., Toussaint (1980) and Jaromczyk and Toussaint (1992)). 

For Xi , Xn F the domination number of the associated data-random proximity catch digraph 
D, denoted F, Ny), is the minimum number of points that dominate all points in X„. Note that, 

7(A'„; F, Ny) = 1 iff A-^ n ri(A'„) ^ 0. 

The random variable j„ li'^n] F, Ny) depends on explicitly, and on F and Ny implicitly. In 
general, the expectation E [7„], depends on n, F, and Ny, 1 < E [7„] < n; and the variance of 7„ satisfies, 
< Var [7„] < n'^/A. 

We can also define the regions associated with 7„ = for k < n. For instance, the r2-region for proximity 
map Ny{-) and set B C is T2{B) = {{x, y) E[n \ Ti{B)Y : B C Ny{x) U Ny{y)}. In general, 

Tk{B) = {{xi,X2, ■ ■ ■ , Xk) E fl'' : B C- u'^^iNy{xj) and all possible m-permutations (ui, U2, . . . , Um) 

of {xi, X2, ■ ■ ■ , Xk) satisfy (ui, U2, ■ ■ ■ , u,„) ^ Tm{B) for each m — 1,2, . . . ,k — 1}. 



2 A Class of Proximity Maps and the Corresponding Fi-Regions 

Let = and let y = {yi,y2,y3} C be three non-coUinear points. Denote by T{y) the triangle 
— including the interior — formed by these three points. The most straightforward extension of the data 
random proximity catch digraph introduced by Priebe et al. (2001) is the spherical proximity map Ns{x) = 
B(x,r(x)) which is the ball centered at x with radius r{x) = miny^y d{x,y) or the arc-slice proximity map 
Nas{x) = B{x,r{x)) n T{y). However, both cases suffer from the intractability of the Fi-region and hence 
the intractability of the finite and asymptotic distribution of 7„. We propose a new class of proximity regions 
which does not suffer from this drawback. 

For r E [l,oo] define Ny to be the r-factor proximity map and F^^ to be the corresponding Fi-region 
as follows; see also Figures [T] and [H Let "vertex regions" i?(yi), Riy-i), Riys) partition T{y) using seg- 
ments from the center of mass of T{y) to the edge midpoints. For x E T{y) \ y, let v{x) € 3^ be the 



vertex whose region contains x; x G R{v{x)). If x falls on the boundary of two vertex regions, we assign 
v{x) arbitrarily. Let e{x) be the edge of T{y) opposite v{x). Let £{v{x),x) be the line parallel to e{x) 
through X. Let d{v{x),i{v{x),x)) be the Euclidean (perpendicular) distance from v{x) to £{v{x),x). For 
r £ [1,00) let £r{v{x),x) be the Hue parallel to e{x) such that d{v{x),£r{v{x),x)) = rd{v{x) , £{v{x) , x)) and 
d{£{v{x),x),£r{v{x),x)) < d{v{x),£r{v{x),x)). Let Tr{x) be the triangle similar to and with the same ori- 
entation as T{y) having v{x) as a vertex and £r{v{x), x) as the opposite edge. Then the r-factor proximity 
region N^{x) is defined to be Tr{x) n T{y). 

To define the Fi-region, let £,j{x) be the Hue such that ^j{x)r\T{y) ^ and rd{yj,^j{x)) = d{yj , £{yj , x)) 
for j = 1,2,3. Then r[{x) = U^^^ (L^ (x) n i?(yj)) where r^(x) n i?(yj) = {z e Riyj) : diyj,£{yj, z)) > 
d{yj,^j{x)}, for j = 1, 2, 3. Notice that r > 1 impUes x S Ny{x) and a; e ^i{x)- Furthermore, limr^oo Ny{x) = 
T(y) and lim^^oo F^a;) = T{y) for all a; e T{y) \ y, and so we define N^{x) = T{y) and Ff=(a;) = T(J^) 
for all such x. For x G 3^, we define Ny{x) = {x} for all re [1, 00]. 

Notice that Xi *~ i^, with the additional assumption that the non-degenerate two-dimensional probability 
density function / exists with support(/) C T{y), impHes that the special case in the construction of Ny 
— X falls on the boundary of two vertex regions — occurs with probability zero. Note that for such an F, 
Ny{x) is a triangle a.s. and F^(x) is a star-shaped polygon (not necessarily convex). 




Figure 1: Construction of r-factor proximity region, Ny{x) (shaded region). 

Let Xe := aTgmmx^x„ d{X, e) be the (closest) edge extremum for edge e. Then ri{X„) = n^^;^F^(Xe^. ), 
where ej is the edge opposite vertex y^, for j = 1,2,3. So F^^ (<%"„) n R{yj) = {z £ R{yj) ■ d{\/ j , £{y j , z) > 
d(y„C,(XeJ)}, for j = 1,2,3. 

Let the domination number be 7ri(r) := 7„(A'„; i^, A^^^) and := &ig'minx^x„r]R{yj)'^{'^T^j)- Then 
Inir) < 3 with probability 1, since Xn n R{yj) C Ny{Xy^) for each j = 1, 2, 3. Thus 

1 < E [7„(r)] < 3 and < Var [7„(r)] < 9/4. 



Figure 2: Construction of the Fi-region, (x) (shaded region). 



3 Null Distribution of Domination Number 

The null hypothesis for spatial patterns have been a contraversial topic in ecology from the early days. 
[Gotelli and Graves, 1996] have collected a voluminous literature to present a comprehensive analysis of the 
use and misuse of null models in ecology community. They also define and attempt to clarify the null model 
concept as "a pattern-generating model that is based on randomization of ecological data or random sampling 
from a known or imagined distribution. . . . The randomization is designed to produce a pattern that would 
be expected in the absence of a particular ecological mechanism." In other words, the hypothesized null 
models can be viewed as "thought experiments," which is conventially used in the physical sciences, and 
these models provide a statistical baseline for the analysis of the patterns. For statistical testing, the null 
hypothesis we consider is a type of complete spatial randomness; that is, 

Ho:X,::i^U{T{y)) 

where U{T{y)) is the uniform distribution on T{y). If it is desired to have the sample size be a random 
variable, we may consider a spatial Poisson point process on T{y) as our null hypothesis. 

We first present a "geometry invariance" result which allows us to assume T{y) is the standard equilateral 
triangle, r((0, 0), (1, 0), (1/2, V3/2)) , thereby simplifying our subsequent analysis. 

Theorem 1: Let y = {yi,y2,y3} C be three non-coUinear points. For z = 1, • • • , n, let 
F ~ U{T{y)), the uniform distribution on the triangle T{y). Then for any r e [l,oo] the distribution of 
'y{Xn'-,^^{T{y)), Ny) is independent of y, and hence the geometry of T{y). 

Proof: A composition of translation, rotation, reflections, and scaling will take any given triangle To ~ 
T({yi,y2,y3}) to the "basic" triangle = T({(0, 0), (1, 0), (ci, ca)}) with < ci < 1/2, C2 > and (l-ci)2 + 
C2 < 1, preserving uniformity. The transformation 0e : — * given by if)e{u,v) = + ^^^^ v, 2% 

takes Tf, to the equilateral triangle = T({(0,0), (1,0), (1/2, V3/2)}). Investigation of the Jacobian shows 
that also preserves uniformity. Furthermore, the composition of 0e with the rigid motion transformations 
maps the boundary of the original triangle. To, to the boundary of the equilateral triangle, Te, the median 
lines of To to the median lines of Te, and lines parallel to the edges of To to lines parallel to the edges of Tg. 
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Table 1: The number of 7„(3/2) = fc out of iV = 1000 replicates. 



Since the distribution of 'y{Xn;L{{T{y)), Ny) involves only probability content of unions and intersections 
of regions bounded by precisely such lines, and the probability content of such regions is preserved since 
uniformity is preserved, the desired result follows. ■ 

Based on Theorem 1 and our uniform null hypothesis, we may assume that T{y) is a standard equilateral 
triangle with y = {(0,0), (1,0), (l/2,^/3/2)} henceforth. 

For our r-factor proximity map and uniform null hypothesis, the asymptotic null distribution of "fn{f) '■= 
j{Xn;U{T{y)), N^) can be derived as a function of r. We denote by i,^ := {z e T{y) : N^{z) = T{y)} the 
superset region associated with Ny in T{y). Notice that c^y C T\{Xn) for all r and XnOi^y ^ ^ implies that 
l7i{r) = 1. 



Proposition 1: The expected area of the the Fi-region, E [^(r5^(A:'„))], converges to the area of the 

y) 

Proof: See Appendix. 



superset region, A{(;y), as n ^ oo. In particular, E [A(r'^^^(A'„))], goes to zero at rate 0{n ^) as n ^ oo 



As a corollary to the above proposition, we have that E [y4(r5^ (<%"„))] ^i^y) = for ^ ^ [I73/2] as 
n ^ 00. Additionally, E [^(rj (<%-„))] ^ A{^^) = (1 - 3/(2 r))^ ^3 for r e (3/2,2], and E[A{r[{Xn))] ^ 
A{c^^) = 73/4 (1 - 3/r2) for r e (2, 00], as n —I- 00. 

Theorem 2: The domination number 7n(?') = 'y{Xn;U{T{y)), Ny) is degenerate in the limit for r G 
[1, 00] \ {3/2} as n 00. 

Proof: For r G [1,3/2), = and T{y) \ N^{Xy]) has positive area for ah j = 1,2,3. Furthermore, 
^"(3^) \ iNyiX[j]) U N^{X[),])) has positive area for all pairs {j,k} C {1,2,3}. Recah that 7„(r) < 3 with 
probability 1 for all n and r. Hence 7n(7') — > 3 in probability as n — s- cxj. 

For r G (3/2, 00], <;y has positive area, so 7n(f) ^ 1 in probability as n — 00. ■ 

Theorem 3: For r = 3/2, lim„^oo7ri(^) > 1 a.s. In particular 

hm 7n(3/2) 

Thus E [7„(3/2)] ^ ^ « 2.2587 as n ^ 00, and Var [7„(3/2)] ^ ct^ « .1918 as n 00. 
Proof: See Appendix. ■ 

The finite sample distribution of 7„(3/2), and hence the finite sample mean and variance, can be obtained 
by numerical methods. We estimate the distribution of 7„(3/2) for various fixed n empirically. In Table[l] 
we present empirical estimates for n = 10, 20, . . . , 100, 200, 300 with 1000 Monte Carlo replicates. See also 
Figured 

Theorem 4 Let 7„(r) = j{Xn;U{T{y)), N^). Then n < ra implies 7„(r2) <^'^ 7„(ri). 

Proof: Suppose n < rj. Then P(7«(?-2) = 1) > Pilnin) = 1) and P(7«(r2) = 2) > P{jn{ri) = 2) and 
P{ln{r2) = 3) < P(7„(?'i) = 3). Hence the desired result follows. ■ 
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Figure 3: Plotted are the empirical estimates of P(7„(3/2) = k) versus various n values. 

4 The Null Distribution of the Mean Domination Number in the 
Multiple Triangle Case 

Suppose 3^ is a finite collection of points in M.'^ with |3^| > 3. Consider the Delaunay triangulation (assumed 
to exist) of y, where Tj denotes the j*'' Delaunay triangle, J denotes the number of triangles, and Cniy) 
denotes the convex hull of y (Okabe et al. (2000)). We wish to investigate 

Ho:X,'^U{CH{y)) 

against segregation and association alternatives (see Section [5]). 

Figure 3] presents a realization of 1000 observations independent and identically distributed according to 
UiCniy)) for \y\ = 10 and J = 13. 
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Figure 4: A realization of Hq for |3^| = 10, J = 13, and n 1000. 

The digraph D is constructed using Ny,{-) as described above, where for Xi g Tj the three points in y 
defining the Delaunay triangle Tj are used as yj . Let 7„^- (r) be the domination number of the component 
of the digraph in Tj, where rij = \Xn H Tj\. 



Theorem 5: (Asymptotic Normality) Suppose rij ^ 1 and J is sufficiently large. Then the null 
distribution of the mean domination number Gj -j X]/=i 7nj(3/2) is given by 

77 approx . 2 / j\ 

Gj - A/(Ai,cr /J) 

where /i and cr^ are given in Theorem 3 above. 

Proof: For fixed J sufficiently large and each nj sufficiently large, j^. (3/2) arc approximately indepen- 
dent identically distributed as in Theorem 3. ■ 

Figure O indicates that, for J = 13 with the realization of y given in Figure 2] and n = 100 the normal 
approximation is not appropriate, even though the distribution looks symmetric, since not all rij are suf- 
ficiently large, but for n — 1000 the histogram and the corresponding normal curve are similar indicating 
that this sample size is large enough to allow the use of the asymptotic normal approximation, since all Uj 
are sufficiently large. However, larger J values require larger sample sizes in order to obtain approximate 
normality. 




Figure 5: Depicted are Gj 7V'(/i w 2.2587, tiV J w .1917/ J) for J = 13 and n = 100 (left) n = 1000 

(right). Histograms are based on 1000 Monte Carlo replicates and the curves are the associated approximating 
normal curves. 

For finite n, let Gj{r) be the mean domination number associated with the digraph based on Ny. Then 
as a corollary to Theorem 4 it follows that for ri < 7-2, we have Gj{r2) Gj{ri). 

5 Alternatives: Segregation and Association 

In a two class setting, the phenomenon known as segregation occurs when members of one class have a 
tendency to repel members of the other class. For instance, it may be the case that one type of plant does 
not grow well in the vicinity of another type of plant, and vice versa. This implies, in our notation, that Xi 
are unlikely to be located near any elements of y. Alternatively, association occurs when members of one 
class have a tendency to attract members of the other class, as in symbiotic species, so that the Xi will tend 
to cluster around the elements of y, for example. See, for instance, |Dixon, "19941 , ICoomes et al., 1999| . 

We define two simple classes of alternatives, and with e e (0, \/3/3), for segregation and asso- 
ciation, respectively. Let J^e = {(0, 0), (0, 1), (1/2, V3/2)} and = T(3^e). For y e y^, let e(y) denote the 
edge of Te opposite vertex y, and for x £ let ^y(x) denote the line parallel to e(y) through x. Then define 

r(y, e) = {x e Te : d(y, £y{x)) < e}. Let be the model under which X, - U{Te \ UyeyT{y, e)) and be 

the model under which Xi hl{Uy(=yT{y, a/3/3 — e)). Thus the segregation model excludes the possibility 
of any Xi occurring near a yj, and the association model requires that all Xi occur near y^'s. The a/3/3 — £ 
in the definition of the association alternative is so that £ = yields Hq under both classes of alternatives. 



Remark: These definitions of the alternatives are given for the standard equilateral triangle. The 
geometry invariance result of Theorem 1 still holds under the alternatives, in the following sense. If, in an 
arbitrary triangle, a small percentage 6 ■ 100% where 6 S (0,4/9) of the area is carved away as forbidden 
from each vertex using line segments parallel to the opposite edge, then under the transformation to the 
standard equilateral triangle this will result in the alternative j——. This argument is for segregation; a 



/3<5/4 

similar construction is available for association. 

Theorem 6: (Stochastic Ordering) Let 7„,e(r) be the domination number under the segregation alter- 
native with e > 0. Then with G (0, \/3/3), j = 1, 2, ei > 6-2 implies that 7„,ei(3/2) 7„^e2(3/2). 

Proof: Note that P(7„,,,(3/2) 1) > P(7„,e.(3/2) = 1) and P(7„,ei(3/2) = 2) > P(7„,,,(3/2) = 2), 
hence the desired result follows. ■ 

Note that for Theorem 6 to hold in the limiting case, ei € (0, VS/A] and es e (%/3/4, V3/3) should hold. 
For £ g (0,^3/4], 7„,e(3/2) ^ 2 in probability as n ^ oo, and for s G (\/3/4, \/3/3), 7,i,e(3/2) 1 in 
probability as n — > cxd. 

Similarly, the stochastic ordering result of Theorem 6 holds for association for all e and n < c», with the 
inequalities being reversed. 

Notice that under segregation with e e (0, \/3/4), 7n.e(f) is degenerate in the limit except for r = 
(3 — \/3e)/2. With e € (•\/3/4, '\/3/3), 7n,e(r) is degenerate in the limit except for r ~ \/?i/e — 2. Under 
association with e € (0, -\/3/4), 7,1, £(7") is degenerate in the limit except for r = 2(1-^^) ■ 

The mean domination number of the proximity catch digraph, Gj := j Irij (3/2), is a test statistic 

for the segregation/association alternative; rejecting for extreme values of Gj is appropriate, since under 
segregation we expect Gj to be small, while under association we expect Gj to be large. Using the equivalent 
test statistic 

S = \/j(Gj - y)/cj, (1) 
the asymptotic critical value for the one-sided level a test against segregation is given by 

= $-!(«) (2) 

where $(•) is the standard normal distribution fimction. The test rejects for S < z\-a. Against association, 
the test rejects for S > Za. 

Depicted in Figure [6] are the segregation with 8 = 1/16 and association with (5 = 1/4 realizations for 
|3^| ~ 10 and J = 13; and n = 1000. The associated mean domination numbers arc 2.308, 1.923, and 3.000, 
for the null realization in Figure [4] and the segregation and association alternatives in Figure [6l respectively, 
yielding p-values .660, .003 and 0.000. Wc also present a Monte Carlo power investigation in Section 6 for 
these cases. 



Theorem 7: (Consistency) Let J* (a, 



where [•] is the ceiling function and e-depcndence 



is through Gj under a given alternative. Then the test against which rejects for S < is consistent 
for all e e (0, \/3/3) and J > J*{1 — a, e), and the test against which rejects for 5* > Zq is consistent for 
all e e (0,\/3/3) and J > J*(a,e). 

Proof: Let e > 0. Under , 7„^e(3/2) is degenerate in the limit as rt ^ 00, which implies Gj is a 
constant a.s. In particular, for e <E (0, a/3/4], Gj = 2 and for e G (a/3/4, a/3/3), Gj = 1 a.s. as n ^ 00. 
Then the test statistic S = VJ{Gj — ^)/(t is a constant a.s. and J > J*(l — a, e) implies that S < zi_q a.s. 
Hence consistency follows for segregation. 

Under i?^, as n ^ 00, Gj = 3 for all e £ (0, V3/3), a.s. Then J > J*(a,e) implies that S 
hence consistency follows for association. ■ 



Figure 6: A realization of segregation (left) and association (right) for |3^| = 10, J = 13, and n = 1000. 

6 Monte Carlo Power Analysis 



In Figure [71 we observe empirically that even under mild segregation we obtain considerable separation 
between the kernel density estimates under null and segregation cases for moderate J and n values suggesting 
high power at a = .05. A similar result is observed for association. With J = 13 and n = 1000, under Hq, 
the estimated significance level is Si = .09 relative to segregation, and a = .07 relative to association. Under 
^Vs/s' empirical power (using the asymptotic critical value) is /3 = .97, and under H^^^^, [3 = 1.00. 
With J = 30 and n = 5000, under Hq, the estimated significance level is S = .06 relative to segregation, and 
a = .04 relative to association. The empirical power is /3 = 1.00 for both alternatives. 



We also estimate the empirical power by using the empirical critical values. With ,/ = 13 and n = 1000, 



under H^^^ , the empirical power is f3mc - 
power is [3mc — 1-00 at empirical level a,, 
power is [3mc = 1-00 at empirical level a,; 
empirical level dime = .04. 
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Figure 7: Two Monte Carlo experiments against the segregation alternatives H^^^ with 5 = 1/16. Depicted 

are kernel density estimates of Gj for J = 13 and n ~ 1000 with 1000 replicates (left) and J = 30 and 
n = 5000 with 1000 replicates (right) under the null (solid) and alternative (dashed). 



7 Extension to Higher Dimensions 



The extension to for d > 2 is straightforward. Let y — {yi,y2, • ■ • ,yd+i} be d + 1 non-coplanar points. 
Denote the simplex formed by these d+1 points as &{y)- (A simplex is the simplest polytope in R'' having 
d + 1 vertices, d{d+ l)/2 edges, and d+1 faces of dimension {d — 1).) For r £ [l,oo], define the r-factor 
proximity map as follows. Given a point x in S(3^), let y := argminygj; volume((5y(a:)) where Qy{x) is 
the polytope with vertices being the d{d + l)/2 midpoints of the edges, the vertex y and x. That is, the 
vertex region for vertex v is the polytope with vertices given by v and the midpoints of the edges. Let 
v{x) be the vertex in whose region x falls. If x falls on the boundary of two vertex regions, we assign 
v{x) arbitrarily. Let ip{x) be the face opposite to vertex v(x), and ri[v{x),x) be the hyperplane parallel 
to f{x) which contains x. Let d{v{x),r]{v{x),x)) be the (perpendicular) Euclidean distance from v{x) to 
ri{v{x),x). For r G [l,oo), let rir(v{x),x) be the hyperplane parallel to (p{x) such that d{v{x),rir{v{x),x)) = 
r d{y{x), r]{v{x), x)) and d{ri{v{x), x), r]r{v{x), x)) < d(v{x), r]r{v{x), x)). Let &r{x) be the polytope similar to 
and with the same orientation as S having v{x) as a vertex and r]r(v{x), x) as the opposite face. Then the r- 
factor proximity region Ny{x) := &r{x) D &{y)- Also, let (ji^) be the hyperplane such that Q{x) D &{y) ^ 
and rd(yj,Cj(a;)) = d{yj,r]{yj,x)) for j = l,2,...,d+ 1. Then ri{x) = U^+i^ (FJ (x) n R{yj)) where 
riix) n i?(y,) = {ze i?(y,) : d{y,,jfy,,z)) > d(yj, C,(x)}, for j = 1, 2, 3. 

Theorem 1 generalizes, so that any simplex & in R'' can be transformed into a regular polytope (with 
egdes being equal in length and faces being equal in volume) preserving uniformity. Delaunay triangulation 
becomes Delaunay tessellation in R'', provided that no more than d+1 points being cospherical (lying on 
the boundary of the same sphere). In particular, with d = 3, the general simplex is a tetrahedron (4 vertices, 
4 triangular faces and 6 edges), which can be mapped into a regular tetrahedron (4 faces are equilateral 
triangles) with vertices (0,0,0), (1, 0, 0) (1/2, V3/2, 0), (1/2,^3/6,76/3). Let 7„(r,d) be the domination 
number for the extension to R'^. Then it is easy to see that 7„(r, 3) is nondegenerate as n oo for 
r = 4/3, and otherwise degenerate. In R'*, it can be seen that jn{r,d) is nondegenerate in the limit only 
for r ~ {d + l)/d. Moreover, it can be shown that lim„^oo ^(2 < 7n((d + l)/d, d) < d + 1) — 1, and we 
conjecture that 

lim P{d < 7„((<i+ l)/d,d) < d + 1) = 1. 



7.1 Discussion 



In this article we investigate the mathematical properties of a domination number method for the analysis 
of spatial point patterns. 

The first proximity map related to r-factor proximity map, A^^, in literature is the spherical proximity 
map, Ns{x) := B{x, r{x)), (which is called CCCD in the literature, see Priebe et al., 200"T], [DeVinney et al., 2002 



[Marchette and Priebe, 2003| , [Priebe et al., 2003a| , and [Priebe et al, 2003b| ). A slight variation of Ns is 
the arc-slice proximity map NAsi^) '■— B{x, r(x))nT{x) where T{x) is the Delaunay cell that contains x (see 
[Ceyhan and Priebe, 2003a[). Furthermor e, Ceyhan and Priebe introduced the central similarity proximity 



map, Ncs, in Ceyhan and Priebe, 2003a] . The r-factor proximity map, when compared to the others, has 



the advantages that the asymptotic distribution of the domination number 7n (r) is tractable (see Theorem 
3). The distribution of the domination number of the proximity catch digraphs based on Ns or Nas is not 
tractable, and that of Ncs is an open problem. Furthermore, Ncs and Ny enjoy the geometry invariance 
property over triangles for uniform data. Moreover, while finding the exact minimum dominating sets is an 
NP-Hard problem for Ns-, Nas^ and N^g, the exact minimum dominating sets can be found in polynomial 
time for Ny. Additionally, Nas{x), Ny{x), and Ncg{x) are well defined only for x € C//(3^), the convex 
hull of y, whereas Ns{x) is well defined for all x € R'^. 

The Ns (the proximity map associated with CCCD) is used in classification in the literature, but not 
for testing spatial patterns between two or more classes. We develop a technique to test the patterns 
of segregation or association. There are many tests available for segregation and association in ecology 
literature. See [Dixon, 1994[ for a survey on these tests and relevant references. Two of the most commonly 



used tests are Pielou's test of independence and Ripley's test based on K{t) and L{t) functions. However, 
the test we introduce here is not comparable to either of them. Our method deals with a slightly different 
type of data than most methods to examine spatial patterns. The sample size for one type of point (type X 
points) is much larger compared to the the other (type y points). 

The null hypothesis we consider is considerably more restrictive than current approaches, which can be 
used much more generally. The null hypothesis for testing segregation or association can be described in two 
slightly different forms ( [Dixon, 1994| ): 

(i) complete spatial randomness, that is, each class is distributed randomly throughout the area of interest. 
It describes both the arrangement of the locations and the association between classes. 

(ii) random labeling of locations, which is less restrictive than spatial randomness, in the sense that ar- 
rangement of the locations can either be random or non-random. 

Our test is closer to the former in this regard. 
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8 Appendix 



Proof of Proposition 1 

To prove Proposition 1, we show that the expected locus of the boundary of the Fi-region, d{T\{Xn)), goes to 
as n — > cx) by showing that the expected loci of Xe^ are ej for j — 1,2,3. See |Ceyhan and Priebe, 2003b] 
for the details. 

For sufficiently large n and given X^^ = {xj,yj) for j ~ 1, 2,3, 

A(r?/' {Xn))^ \/3/9(3 -6x2 + 2 ^/3^/2 X2 - 2 V3y2 + yz' + 3 + yg' - 2 VSy^ + 3 xg" + 4 yi'-). 
The asymptotically accurate joint pdf of X^j 's is 

/3(C) = n{n -l){n- 2) (V3/36(-2 VSyi + VSy^ - 3x3 + V3y2 + 3 X2)')""V(a/3/4)" 

with the support Ds = {C = (2:1, j/i, 2^2, 2/2, a^3, J/s) S ■ i^j^yj)'^ are distinct}. Then for sufficiently 
large n, E [A(rf^(A'„))] w /^^ A(rf ^(A'„))/3(C)dC, which goes to as n 00 at the rate 0(n-2). See 
[Ceyhan and Priebe, 2003bl for the details. 



Proof of Theorem 3 

We know that 7,i(r) < 3 a.s. for all r S [1, 00] and all n. First we show that lim„^oo ^'(7?i(3/2) >!) = !. 



Note that F(7„(3/2) > 1) = P{Xr, n rl'\X„) = 0). Then we find P{Xr, n rl'\Xn) = 0, £^2(?^,e)) 

2e 
V3 



where E2{n,e) is the event such that < Xi < 1 - and < Zi < e, and 1/2 < X2 < 1 - — 



V3' 

\/3(l - X2) - e < Z2 < 73(1 - X2), and < < 1/2, and V3X3 - e < Z3 < VSXs- First letting 
then e ^ 0, yields the desired result. See [Ceyhan and Priebe, 2003b] for the details. 



n 00, 

Next, lim„^oo ^'(7n(3/2) < 2) = lim„^oo ^'(7n(3/2) = 2), since lim„^oo ^'(7«(3/2) = 1) = 0. Let 
Qj argmin^g;t,^nfl(y,) dix,ej) = argmax^g;t.^n/?(y,) dii{yj,x),ej) 
where Cj is the edge opposite vertex yj for j ~ 1,2,3 and let qj ~ (xj,yj) be the realization of Qj for 

iy^^iQi) U N^^\Q2) or Xn C iV^/'(Oi) U iV^/ 



J = 1,2,3. Then 7™ (3/2) < 2 iff A"™ C N^^^iQi) U N^^^{Q2) or A'„ C iV^/'(Oi) U iV^/'(03) or X^, C 



iv^/'(g2)uiv^/'(g3). 

Let the events E^.j := X^ C Ny^^{Q,) U Ny^^{Qj) for (i, j) = {(1,2), (1,3), (2,3)}. Then 

-P(7n(3/2) < 2) = P(£;i,2)+/'(Si,3)+/'(£^2,3)-/'(£^i,2n£;i,3)-P(£;i,2n£;2,3)-/'(£^i,3n£;2,3)+/'(£^i,2n£;i,3ni;2,3) 

By symmetry, PiEi^2) = ^^(£^1,3) = ^'(^^2.3) and P{Ei^2 H i;i,3) = P{Eia D £2,3) = P(i;i.3 H i;2,3). Hence 

p(7„(3/2) < 2) - 3 [piEi^2) - P{Eia n i;i.3)] + P{Ei^2 n £;i,3 n E2^3)- 

We find P(£'i_2), by finding the asymptotically accurate joint pdf of Qi, (52- Let T{Qj) be the triangle 
formed by the median lines at and y; for k,l j and i{yj, Qj), and let e > be small enough such that 
T{Qj) C R{yj), for j ^ 1, 2, 3. Then the asymptotically accurate joint pdf of Qi, Q2 is 



/i,2(a;i,yi,a:^2,2;2) ^n{n-l) 



{ A{T{y)) - A{T{q,)) - A{T{q2)y ''-^ 



A{T{y)Y V ^(T^CV)) 



where A{T{qi)) = ^/3/36 (-2 V3 + 3 j/i + 3 V3xi) and A(T(q2)) = V3/36 (-3 y2 - \/3 + 3 V3a;2) with 

domain £»/ = {(a-i,yi) G i?(yi) : yi > + VSxi + VSe, (^2,^2) € i?(y2) : y2 < + V3x2 - VSe} 

with e > be small enough such that T{Qj) C R{yj), for j = 1, 2, 3. 



Then P(i?i 2) ~ .4126 (which is found numerically). See Ceyhan and Priebe, 2003b] for the details. 

Similarly we find P(i?i,2 n i?i,3), by finding the joint pdf of Qi, Q2, Q3, where T{q3) is the triangle 
with vertices i (V3 — 3 j/3)\/3, 2/3), (l/2,v^/6), {V^ys^Ua)- Then the asymptotically accurate joint pdf of 
Qi, Q2, Q3 is 



fi23{xi,yi,X2,y2,X3,y3) = n (n - 1) (71 - 2) ■ 



1 fA{T{y)) - A{T{q,)) - A{T{q2)) - A{T{q3)r 



A{T{y))^ V A{T{y)) 

where A{T{q3)) = ^{-VS + 6y3f with domain Dj = {{xi,yi) e i?(yi) : yi > + VSxi + 

VSe, {X2,y2) & R{y2) ■■ y2>-^ +V^X2-V3e, (2:3, ^3) e i?(y3) : y3<^ + £}. 

Then P{Ei^2 n -Bi^3) ~ -2009 (see [Ceyhan and Priebe, 2003b] for the details.) 

Likewise, we find P(-Ei,2 n Ei^3 n £'2,3) « .1062 (see [Ceyhan and Priebe, 2003b] for the details.) 

Hence we get lim„^oo P{-/{Xn, n!/^) = 2) w .7413, and lim„^oo P(7(A'„, nI^^) = 3) w .2587. 



